Search CORE

67 research outputs found

What is speech rhythm?:A commentary inspired by Arvaniti & Rodriquez, Krivokapić, and Goswami & Leong.

Author: Shattuck-Hufnagel Stefanie
Turk Alice
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/05/2013
Field of study

Edinburgh Research Explorer

Recommended from our members

Chapter 2: The Original ToBI System and the Evolution of the ToBI Framework

Author: Beckman Mary E.
Hirschberg Julia Bell
Shattuck-Hufnagel Stefanie
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2004
Field of study

In this chapter, the authors will try to identify the essential properties of a ToBI framework annotation system by describing the development and design of the original ToBI conventions. In this description, the authors will overview the general phonological theory and the specific theory of Mainstream American English intonation and prosody that the authors decided to incorporate in the original ToBI tags. The authors will also state the practical principles that led us to make the decisions that the authors did. The chapter is organised as follows. Section 2.2 briefly chronicles how the MAE_ToBI system came into being. Section 2.3 briefly describes the consensus account of English intonation and prosody on which the MAE_ToBI system is based. Section 2.4 catalogues the different components of a MAE_ToBI transcription and lists the salient rules which constrain the relationships between different components. This section also expands upon the theoretical foundations and practical consequences of adopting the general structure of multiple labelling tiers, and particularly the separation of the labels for tones from the labels for indexing prosodic boundary strength. Section 2.5 then describes some of the extensions of the basic ToBI tiers that have been adopted by some sites. This section also compares our decisions about the number of tiers and about inter-tier constraints with the analogous decisions for some of the other ToBI systems described in this book. Section 2.6 discusses the status of the symbolic labels relative to the continuous phonetic records that are also an obligatory component of the MAE_ToBI transcription. Section 2.7 then closes by listing several open research questions that the authors would like to see addressed by MAE_ToBI users and the larger ToBI community

Columbia University Academic Commons

Hierarchical distinctions in the production and perception of nuclear tunes in American English

Author: Cole Jennifer
Shattuck-Hufnagel Stefanie
Steffman Jeremy
Tilson Sam
Publication venue: 'Open Library of the Humanities'
Publication date: 02/06/2023
Field of study

Edinburgh Research Explorer

Prosodic Effects of Discourse Salience and Association with Focus

Author: Breen Mara
Flemming Edward
Gibson Edward A.
Shattuck-Hufnagel Stefanie
Wagner M.
Publication venue: International Speech Communication Association (ISCA)
Publication date: 01/05/2010
Field of study

Three factors that have been argued to influence the prosody of an utterance are (i) which constituents encode discourse-salient information; (ii) which constituents are contrastive in that they evoke alternatives; and (iii) which constituents interact with the meaning of focus operators such as only (i.e., they ‘associate’ with focus). One challenge for a better understanding of these factors and their interaction has been the difficulty of finding a way to evaluate hypotheses quantitatively, since individual variation in productions is often large enough to wash out experimental effects. In this paper, we apply a methodology introduced in [1] to control for such variation and present evidence for how the three factors interact to influence prosody in sentences containing single or multiple foci

DSpace@MIT

Three steps forward for predictability : Consideration of methodological robustness, indexical and prosodic factors, and replication in the laboratory

Author: Docherty Gerry
Foulkes Paul
Hughes Vincent
Shattuck Hufnagel Stefanie
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2018
Field of study

There is now abundant evidence that phonetic forms are shaped by probabilistic effects reflecting predictability or informativity. We outline a number of challenges for such work, where theoretical claims are often based on small differences in acoustic measurements, or interpretations of small statistical effect sizes. We outline caveats about the methods and assumptions encountered in many studies of predictability effects, particularly regarding corpus-based approaches. We consider the wide range of factors that influence patterns of variability in phonetic forms, taking a broad perspective on what is meant by “the message” in order to show that predictability effects need to be considered alongside many others, including indexical and prosodic factors. We suggest a number of ways forward to extend our understanding of the form-predictability relationship.Full Tex

Crossref

White Rose Research Online

Speech Communication

Author: Klatt Dennis H.
Perkell Joseph S.
Shattuck-Hufnagel Stefanie
Stevens Kenneth N.
Zue Victor W.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1985
Field of study

Contains reports on five research projects.C.J. LeBel FellowshipKurzweil Applied IntelligenceNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 R01 NS04332)National Science Foundation (Grant 1ST 80-17599)Systems Development FoundationU.S. Navy - Office of Naval Research (Contract N00014-82-K-0727

DSpace@MIT

Speech Communication

Author: Klatt Dennis H.
Perkell Joseph S.
Shattuck-Hufnagel Stefanie
Stevens Kenneth N.
Zue Victor W.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1983
Field of study

Contains reports on eight research projects.C.J. LeBel FellowshipSystems Development FoundationNational Institutes of Health (Grant 5 T32 NS 07040-08)National Institutes of Health (Grant 5 R01 NS 04332-20)National Science Foundation (Grant 1ST 80-1759)National Science Foundation (Grant 1ST 80-17599 and MCS-8112899)U.S. Navy - Office of Naval Research (Contract N00014-82-K-0727

DSpace@MIT

Speech Communication

Author: Klatt Dennis H.
Perkell Joseph S.
Shattuck-Hufnagel Stefanie
Stevens Kenneth N.
Zue Victor W.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1984
Field of study

Contains reports on eight research projects.C.J. LeBel FellowshipSystems Development FoundationNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 R01 NS04332)National Science Foundation (Grant 1ST 80-17599)U.S. Navy - Office of Naval Research (Contract N00014-82-K-0727

DSpace@MIT

Speech Communication

Author: Klatt Dennis H.
Perkell Joseph S.
Seneff Stephanie
Shattuck-Hufnagel Stefanie
Stevens Kenneth N.
Zue Victor W.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1986
Field of study

Contains reports on four research projects.C.J. LeBel FellowshipKurzweil Applied IntelligenceNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 RO1 NS04332)National Science Foundation (Grant BNS84-18733)Systems Development FoundationU.S. Navy - Office of Naval Research (Contract N00014-82-K-0727

DSpace@MIT

Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features

Author: Arango Javier
Chan Ian
Choi Jeung-Yoon
De Nardis Luca
DeCaprio Alec
Di Benedetto Maria-Gabriella
Shattuck-Hufnagel Stefanie
Publication venue
Publication date: 01/01/2021
Field of study

Modelling the process that a listener actuates in deriving the words intended by a speaker requires setting a hypothesis on how lexical items are stored in memory. This work aims at developing a system that imitates humans when identifying words in running speech and, in this way, provide a framework to better understand human speech processing. We build a speech recognizer for Italian based on the principles of Stevens' model of Lexical Access in which words are stored as hierarchical arrangements of distinctive features (Stevens, K. N. (2002). "Toward a model for lexical access based on acoustic landmarks and distinctive features," J. Acoust. Soc. Am., 111(4):1872-1891). Over the past few decades, the Speech Communication Group at the Massachusetts Institute of Technology (MIT) developed a speech recognition system for English based on this approach. Italian will be the first language beyond English to be explored; the extension to another language provides the opportunity to test the hypothesis that words are represented in memory as a set of hierarchically-arranged distinctive features, and reveal which of the underlying mechanisms may have a language-independent nature. This paper also introduces a new Lexical Access corpus, the LaMIT database, created and labeled specifically for this work, that will be provided freely to the speech research community. Future developments will test the hypothesis that specific acoustic discontinuities - called landmarks - that serve as cues to features, are language independent, while other cues may be language-dependent, with powerful implications for understanding how the human brain recognizes speech.Comment: Submitted to Language and Speech, 202

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza